Personalized Tag Predition Boosted by BaggTaming A Case Study of the Hatena Bookmark

نویسندگان

  • Toshihiro Kamishima
  • Masahiro Hamasaki
  • Shotaro Akaho
چکیده

We stated a learning problem, which we call taming, and develop a method for this problem in [神嶌 08b, 神 嶌 08c, Kamishima 08a]. The learner for this taming requests two types of training data sets, tame and wild. The labels of tame data is highly consistent with a target concept, which we actually want to learn. In contrast, wild data are not so well maintained; thus, some labels are consistent with the target concept, while some others are not. Additionally, we assume that wild data are much more abundant than tame data. This assumption is reasonable, because it is generally difficult to provide a large tame data set due to its high maintenance cost. The goal of the taming is to acquire more accurate classifiers by exploiting wild data. To achieve this goal, we developed a BaggTaming method, which is a modified version of bagging [Breiman 96]. We applied this BaggTaming to a personalized tag prediction for the data set collected from the delicious∗1, and showed the effectiveness of our method. We expect that this taming technique would be generally helpful for a tag prediction task on another collaborative tagging service. To check this hypothesis, we performed the tag prediction task on the hatena bookmark∗2, which is one of popular social bookmarks in Japan.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Context Visualization for Social Bookmark Management

We present the design of a new social bookmark manager, named GalViz, as part of the interface of the GiveALink system. Unlike the interfaces of traditional social tagging tools, which usually display information in a list view, GalViz visualizes tags, resources, social links, and social context in an interactive network, combined with the tag cloud. Evaluations through a scenario case study an...

متن کامل

Collaborative Summarization: When Collaborative Filtering Meets Document Summarization

We propose a new way of generating personalized single document summary by combining two complementary methods: collaborative filtering for tag recommendation and graph-based affinity propagation. The proposed method, named by Collaborative Summarization, consists of two steps iteratively repeated until convergence. In the first step, the possible tags of one user on a new document are predicte...

متن کامل

Effect of Velocity and Direction of Airflow on Thermal Comfort and Energy Consumption: Case of New Exclusive Cooling Systems

The application of personalized ventilation systems, particularly in the buildings, is one of the new topics that today presented to it, because of their proper performance in reducing energy consumption and creating better thermal comfort conditions. In the current study, the energy consumption of a personalized cooling system installed in a room, considering the constraint of thermal comfort,...

متن کامل

Generating Useful Network-based Features for Analyzing Social Networks

Recently, many Web services such as social networking services, blogs, and collaborative tagging have become widely popular. Many attempts are being made to investigate user interactions by analyzing social networks among users. However, analyzing a social network with attributional data is often not an easy task because numerous ways exist to define features through aggregation of different ta...

متن کامل

Frequency of Diabetes Mellitus in Skin Tag Patients in Comparison with Control Group

Background & Aims: Diabetes mellitus is one of the most common types of chronic disease leading to serious complications such as blindness, end stage renal disease (ESRD), neuropathy and cardiac disorders. Skin tag is a common benign skin tumor that its association with impaired carbohydrate metabolism has been reported in some studies. The aim of this study was to compare the frequency of diab...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008